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THE SQUAREFREE FLOW 



RYAN PECKNER 



(N 

o 

(N 



> 
O 



Abstract. The squarefree flow is a natural dynamical system whose topological and 
ergodic properties are closely linked to the behavior of squarefree numbers. We prove 
that the squarefree flow carries a unique measure of maximal entropy and describe 
the structure of the associated measure-preserving dynamical system. Our method in- 
, volves first studying approximations arising from finite collections of prime numbers, 

then taking a limit under Ornstein's d-metric in order to consider all primes simulta- 
neously. This is accomplished by proving uniform Gibbs bounds for a sequence of sofic 
systems and constructing explicit joinings between them in order to directly estimate 
their d-distances. 
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1. Introduction and Background 

One of the most important objects in analytic number theory is the Mobius function, 
defined for positive integers n by 



m , f 1 if n = 1 

^ ' n(n) = < if n is not squarefree 

| It is well-known that if the sequence (/x(n)) n>1 behaves randomly in the sense that 

O ! Yl = Oe (iV 1/2+e ) for any e > 0, 

CN ' n<N 

then the Riemann hypothesis is true. Less quantitative but richer reflections of the 
chaotic behavior of [i are captured by instances of the "Mobius Randomness Law", see 
^ . for example [IK] . 

Sarnak has recently developed an approach to the idea of Mobius randomness based 
on the study of certain dynamical systems |Sarj . Let = {—1, 0, 1} N with the product 
topology, and let T : 9,^) _^ q(3) be the left shift defined by (Tx) n = x n+1 . (rt 3 \T) 
is a topological dynamical system, that is, a pair (X, S) where X is a compact metric 
space and S : X — > X is continuous and surjective. 

The Mobius function defines a point co = (/x(l), /x(2), /x(3), . . . ) G and we let M. 
be the closure in of the orbit {T k u : k > 0}. Then the 'Mobius flow' M. is a closed, 
T-invariant subset of , but due to the mysterious behavior of /j, one cannot say much 
else about it. The purpose of this paper is to study a related system about which we 
can say a great deal. 

Let = {0, 1} N and let 4> : Q,^ ->• 0,^ be the squaring map (x n ) H> (a^) . Observe 
that (ft is continuous, surjective, and intertwines the shift maps on the two spaces (this 
is an example of a factor map between topological dynamical systems). Let 5 be the 
closure in f^ 2 ) of the orbit of (p(uj) = (// 2 (n)) n>1 under T. Since fJ- 2 (n) is the indicator 

function of the set of squarefree numbers, we call (S,T) the squarefree flow. 

l 
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Naturally, the dynamical structure of S is strongly tied to the statistical properties of 
squarefree numbers. In this connection, Sarnak uses the well-known squarefree sieve (cf. 
|Tsaj ) to produce a certain ergodic measure v of zero entropy on whose support is 
S (for the definitions of terms from ergodic theory see [Gla] ) . This allows him in turn 
to prove the following: define the support of a sequence x = (x n ) € f^ 2 ) to be the set 
supp(x) C N of those n such that x n = 1. We call a subset AcN admissible if for every 
prime p the reduction A mod p 2 is a proper subset of Z/p 2 Z. We then call a sequence 
(x n ) E 0( 2 ) admissible if its support is an admissible subset of N. It should be clear that 
{pL 2 {n)) n >\ is admissible in this sense. 

Let A C 0( 2 ) be the set of all admissible sequences. It's easy to show that A is 
a closed, shift invariant subset of Q^ 2 '. Using the aforementioned measure v (and in 
particular the fact that its support is S), Sarnak is able to show that in fact .4 = 5. 
Though this may seem surprising at first, it arises quite naturally from the shape of 
the measure u, which in turn is constructed from the main term of the squarefree sieve 
formula. 

The ergodicity of v implies that the subset of A consisting of sequences whose support 
omits only one residue class modp 2 for every p has full i/-measure. It's then not hard to 
show that the map which sends such a sequence to the point in JC := \\ v TLjp 2r L whose 
p-th coordinate is the residue class omitted by the sequence modp 2 intertwines the shift 
on A with the translation on K, by the element (—1,-1,-1,...) (note that this map 
is Borel measurable but not continuous). Moreover, the pushforward of v under this 
map is precisely the mass one Haar measure on the compact group /C, so we have a fac- 
tor map of measure-preserving dynamical systems (A, T, v) — > (/C, T(_ 1 _ 1 _ lj ...), z^Haar)- 
Subsequently, Cellarosi and Sinai [CS^ used spectral theory techniques to prove that this 
map is in fact an isomorphism of measure-preserving dynamical systems (it is however 
very far from being an isomorphism of topological dynamical systems). 

While this tells the whole story for the measure there are other invariant measures 
on A that should be taken into account. Specifically, as stated by Sarnak (and proven in 
this paper), the system (A,T) has topological entropy (6/-7r 2 )log2 (it's no coincidence 
that 6/-7T 2 is the density of the squarefree numbers in N!). The variational principle ( [Gla] 
Thm. 17.1) states that the topological entropy of a topological dynamical system is the 
supremum of the measure entropies over all its invariant probability measures, and it is 
known that any subshift system possesses at least one measure of maximal entropy, i.e. 
an invariant probability measure whose measure entropy equals the topological entropy 
of the system. Thus there exists such a measure for A. 

While measures of maximal entropy always exist for subshifts (this is not the case 
for all topological dynamical systems), the number of such measures may be arbitrarily 
large, even infinite (see |Hay| for examples). In keeping with the general principle 
in ergodic theory that a scarcity of measures is more meaningful than an abundance 
of them, it is an important problem to determine when there is only one measure of 
maximal entropy on a given topological dynamical system. This uniqueness can be 
of great practical value. For instance, the existence of a unique measure of maximal 
entropy for the geodesic flow on the modular surface is a key ingredient in the proof of 
Linnik's theorem on the representations of integers by integral binary quadratic forms 

m- 

The aim of this paper is to prove the following. 

Theorem (Thm. 13. lOj) . The squarefree flow A possesses a unique measure of maximal 
entropy. 
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Our proof relies on a detailed study of the finite-level systems 

A{r) := {x € f^ 2 ^ : supp(x) mod pf is a proper subset of 'L/pf'L for i = 1, . . . , r} 

where p\,...,p r are the first r primes. Our first result (Prop. 12. 51 below) is that each of 
these systems is a so-called sofic shift, meaning they satisfy a certain finitary condition 
that controls the complexity of the words appearing in the shift (see section [272]) . Such 
shifts were first investigated by Weiss ( |Weij ). who proved that a subshift is sofic if and 
only if it is a factor of a subshift of finite type (none of the A(r) is of finite type) . It should 
be mentioned that although Weiss was the first to give an explicit definition, versions of 
sofic systems were known perhaps long before him to researchers in theoretical computer 
science; indeed, sofic systems can also be characterized as essentially those subshifts that 
are recognizable by a finite automaton (see the excellent book [LM] for this perspective) . 

Sofic shifts are among the few classes of systems that are known to carry a unique 
measure of maximal entropy. However, while the shifts Air) are all sofic the full square- 
free system A = (~l^ =1 ^4(r) is not, so we have to take a different approach to proving that 
it has a unique maximal measure. Naturally, we do this by exhibiting A as a certain 
limit of the systems A(r) with their measures of maximal entropy, specifically as their 
limit under Ornstein's d metric (see section [3751 for precise definitions). Philosophically, 
this metric judges the closeness of two measures on a given shift space by asking how 
much on average one has to change a generic sequence for the first measure in order 
to get a generic sequence for the second measure. Limits in d are much more complex 
than weak* limits. In particular, the ergodic measures in the set of invariant probability 
measures on a shift space are weak* dense but closed in d, and this is one of the key 
facts we use to establish the intrinsic ergodicity of A. 

Our proof that the systems A(r) with their respective unique measures of maximal 
entropy ij r converge under d to (A, rf) (where rj is the weak* limit of the r] r ) proceeds by 
explicitly constructing certain joinings between these systems that reflect their common 
algebraic origin. More precisely, we show the following in section 13.21 

Proposition (Prop. 13. 9p . For each r > 1, there exists a natural factor map (j) r : 
(Zr x n( 2 \m r x B(\,\)) -)• (A(r),r] r ). 

In the above, m r denotes the counting measure on 7L r (equipped with translation 
by (-1, -1, . . . , -1)) and B{\, \) is the maximum entropy Bernoulli measure on 
We therefore get the joining p rs := (ir rs o (f> r x ^ s )„, (m s x B{^,\)) of (A(r),rj r ) and 
(„4(s), rj s ) (where 7r rs : Z s x — > 7L r x 0( 2 ) is the natural projection), which allows us 
to show that the d-distance between them is small. In fact it yields the following bound 
for the ci-distance, which reflects the algebraic origin of the systems in terms of squares 
of the prime numbers p\ < pi < ■ ■ ■ . 

Proposition ( §3.2[) . For any s > r we have 

7/ . 1/^1 1 1 \ 

d(Vr,Vs) ^2\n^ + n^ + '" + rfr 
z \Pr+l Pr+2 PsJ 

This shows that d(i] r ,r] s ) — > as r, s — > oo. 

While these constructions are quite specific to our setting, they reflect a more general 
principle that certain properties (in this case, intrinsic ergodicity) are preserved under 
taking the limit of a rapidly converging (in a suitable sense) decreasing sequence of 
systems with that property. Similar ideas are explored from a very different direction 
in [Gurj . In this vein, we prove a uniformity in the so-called lower Gibbs bounds for 
the measures i] r , which effectively states that, up to a constant, the ^-measure of a 
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cylinder set defined by a word in A(r) is determined by the word's length. This is a 
much stronger form of the asymptotic equipartition property guaranteed for any ergodic 
measure by the Shannon-McMillan-Breiman theorem (cf. |Gla| §14.5). Specifically, we 
show the following by closely analyzing the definition of the measure of maximal entropy 
for a sofic system in terms of the Perron- Frobenius eigenvector of a certain adjacency 
matrix ( |Weij ). 

Proposition (Prop, \2.8\i . There exists a constant L > such that for any r > 1 and 
any word w £ W(A(r)) of length n <p\ - ■ ■ p 2 , we have r] r (w) > L ■ e nh ^ r \ 

This uniformity implies that the measure 77 on A which is the weak* limit of the 
measures r] r also satisfies a lower Gibbs bound. When combined with the ergodicity of 
77, which follows from its being the d-limit of the ergodic measures rj r , this implies by a 
well-known argument that it is the only measure of maximal entropy on A (cf. 13. lip . 

The existence of the factor maps 4> r above, combined with results of Shields and 
Thouvenot on entropy zero x Bernoulli processes, yields the following clean description 
of the maximal entropy measure-preserving dynamical systems, i.e. the systems A(r) := 
(A(r), T, 7] r ) and A := (A, T, 7/). 

Theorem ( §3.4p . We have natural isomorphisms A.{r) = Z r x B r and A = K, x B sq , 
where B r resp. B is the Bernoulli system of entropy equal to that of A(r) resp. A. 

This shows that the very natural system A exhibits an unusually tractable structure, 
in addition to providing a nontrivial concrete setting in which the properties of sofic 
systems, joinings and the <i-distance can be fruitfully applied. 

I would like to thank my advisor Peter Sarnak for numerous helpful discussions. 

2. Finitely many primes 

2.1. The finite- level systems. Let A be the squarefree flow as defined above, and for 
each prime p let 

A(p) = {ye ft {2) : k P (supp(y))| < p 2 } 

and for 1 < k < p 2 — 1 let 

A k {p) = {y € ft (2) : |7r p (supp(y))| = p 2 - fc} . 

Here ir p : Z — > Z/p 2 Z is the projection. Each Ak{p) is a Borel subset of A(p), and one 
can check that A\{p) is open and dense in A{p). Ak(p) isn't quite invariant (consider 
(1,0,0,0, ...)); we should rather consider the set 

|y € r^ 2 ^ : |7T p (supp(y))| = p 2 — k and b{p 2 ) n supp(y) is infinite whenever it's nonempty 

where b(p 2 ) denotes the arithmetic progression {b + £p 2 : £ > 0} for 1 < b < p 2 . This set 
is Borel and invariant (but neither open nor closed) and properly contained in Ak(p), 
but Poincare recurrence implies that their difference has absolute measure zero (i.e. has 
measure zero for any T-invariant measure on A(p)), so we consider them to be the same 
unless otherwise stated. 

Observe that for any p we have up to absolute measure zero 

Ap) = II Mp)- (2.1) 

fc=l 
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It's important to note that the Ak(p) are not compact, and so not suitable for studying 
the topological properties of A(p). However, they will be useful later when considering 
ergodic measures, where only the Borel structure is needed. 

Proposition 2.1. For any p we have 



h t0 p{A{p)) = 




Proof. We use the following well-known formula for the topological entropy of a subshift 
X C fi( 2 ) (it appears as Cor. 14.7 in |Glaj but is misstated there): letting u n denote the 
number of distinct words w = W\W2 ■ ■ ■ w n of length n on the alphabet {0, 1} that have 
form Wi = x(i) for i = 1,2, n for some x G X, we have 

h top (X) = lim -logu n . 

n— >oo n 

In our setting, a sequence w G {0, l} m P 2 has form vj{ = y(i) for some y G A(p) if and only 
if there exists some a G {l, 2, ...,p 2 } (there may be more than one) such that w a+ £ p 2 = 
for £ = 0, 1, 2, m — 1. The remaining entries of w may be chosen freely from {0, 1}. 
Hence, each choice a of an element of {l,2, ...,p 2 } yields 2 mp ~ m = 2 m ( p distinct 
admissible words of length n = mp 2 . However, a word may be admissible for more than 
one choice of a, so we get the inequality 

u mp2 < p 2 2"V-D 

and therefore 



hto P (A(p)) = lim ^\ogu mp 2 

m-^oo mp 



< lim 



m— >oo mp 2 



log p 2 



2 9 m(p 2 -l) 



m(p 2 — 1) 
lim yF ' log 2 

•n— >oo mp 

log 2. 

P 2 ) 

But we also have the opposite inequality: since for a fixed choice a there are precisely 
2?n(p -i) worc ; s f length mp 2 having only zeros along the length m arithmetic sequence 
modp 2 on a, we have 

u rnp2 > 2 m ^ 2 " 1 ) 
from which we get ht op (A(p)) > (1 — 1/p 2 ) log 2, as desired. 

□ 

Since the Ak(p) are disjoint and invariant, for each ergodic measure rj on A(p) there 
exists a unique k such that i](Ak(p)) = 1 while r](Aj (p)) = for j ^ k, and so h v (A(p)) = 
h v {Ck{p)) where Ck{p) is the closure of Ak{p)- 

Proposition 2.2. For any prime p and 1 < k < p 2 — 1, we have 

h top (C k (p)) = (l-^log2. 
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Proof. For any invariant subset X C the closure X is also invariant and has the 

same language as X, i.e. W(X) = W(X) ( |Tho| pg. 3561). It then follows from the 
above proof that 

h top (X) = lim - log u n 

n— >oo n 

where u n is the number of words of length n appearing in X. An easy extension of the 
argument in the above proposition shows that for Ak(p) we have the inequalities 

2 ™(p 2 "fc) < Ump2 < (P^j 2 m(pl-k) 

and since ( p , ) is independent of m, the desired result follows immediately. □ 

It follows from this and the proposition that if r/ is an ergodic measure on A(p), then 
T] has maximal entropy if and only if its restriction to A\{p) has maximal entropy (we're 
using the fact that h v (Ak(p)) < h top (Ck(p)) for any invariant measure 7] on Ak(p))- 
In particular, ij must be supported on A\{p). Now suppose that r] is any invariant 
(not necessarily ergodic) measure of maximal entropy on A(p). Then by Thm. 15.12 
of |Gla] we have h v (A(p)) = j Y h r}y (A(p))di'(y) where r] = f Y r} y dis(y) is the ergodic 
decomposition of i], so r] y is an ergodic measure of maximal entropy on A(p) for z^-a.e. 
y € Y. The above then implies that rj y is supported on A\{p) for v-a.e. y £ Y, and it 
follows that i] is supported on Ai(p) as well. 

We now consider the situation for more than one prime. Let p\,...,p r be a finite 
collection of primes and set 

A(pi,...,p r ) = {y eft 2 : |7T p .(supp(y))| <pj,i = 1,2, ... ,r} 

= nl =1 A( Pi ). 

Also, for each r-tuple k = (ki) G ni=i{l> 2, — 1} define 

A^{pi, . . . ,p r ) = {y £tt 2 : |vr Pi (supp(y))| = pf - h, % = 1, 2, . . . , r} 

= n r i=1 A kt ( Pl ). 

We have the following obvious extension of Prop. 12.11 
Proposition 2.3. For any p±, . . . ,p r and k as above, we have 

htop(C%(Pl, ■ ■ ■ ,Pr)) = log2jJ ^1 - 

where C^(pi, . . . ,p r ) is the closure of AAp\, ■ ■ ■ ,p r ), and 

hto P (A(pi, . . . ,p r )) 



m&xh t0 p(C k (pi, . . . ,p r )) = log2TT ( 1 - \ ] . 

k i\\ pi J 



Proof. This is an easy consequence of the above arguments, since by the Chinese re- 
mainder theorem and the inclusion-exclusion principle we have the inequalities 

2 ™nu( Pl 2 -^) < u k < (pi\(p 2 2\... (p 2 r\ 2 mnu(p*-> 



where «^ p2p 2 p 2 is the number of distinct words of length mp\p\ ■ ■ ■ p 2 . appearing in 
elements of A^(p\, . . . ,p r ). The conclusion follows at once. □ 
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As per the remarks following Prop. 12.21 it follows from this that every invariant 
measure of maximal entropy on A(pi, . . . ,p r ) is supported on Ani i)(pii • • • ,Pr), which 
we abbreviate Ai(pi, . . . ,p r )- 

2.2. Sofic systems. Let X be a closed, invariant subset of the full 2-shift (Q( 2 \T) 
and let W(X) be the set of words that appear in the elements of X. For each word 
w G W(X) define 

F(w) = {ve W(X) : wv G W(X)}, 

the so-called follower set of w. We consider F function W(X) -)■ 2 W ( X \ The 
system (X,T) is then called sofic if the range of F is finite [Wei] . This is equivalent 
to the condition that X be a factor of a subshift of finite type. It is well-known that 
a topologically transitive subshift of finite type possesses a unique measure of maximal 
entropy; this was extended to sofic systems by Fischer and Weiss. 

Theorem 2.4 ( |Fisj Thm. 4). If (X,T) is sofic and topologically transitive, then it 
possesses a unique measure of maximal entropy. 

The construction of this measure is reviewed in the next subsection. A system pos- 
sessing a unique measure of maximal entropy (which is then automatically ergodic) is 
called intrinsically ergodic. Our reason for introducing sofic systems is the following. 

Proposition 2.5. A(p) is sofic and topologically transitive. Hence, it is intrinsically 
ergodic by the above theorem. 

Proof. We begin by showing that A(p) is topologically transitive. Let Ca,b,Ca',b' C 
f^ 2 ) be basic cylinders such that U := Ca,b H A{p) and V := Ca>,b' H A{p) are both 
nonempty. We must show that there exists some n > such that U D T~ n V ^ 0. First, 
as A, B, A', B' are all finite, we may choose a large m such that max A < min(i?' + mp 2 ) 
and maxB < mm(A' + mp 2 ). We then have for any n G {0, 1, ...,p 2 — l} that 

U (~1 T~ ( - n+mp V = CAu(A'+n+mp 2 ),BU(B'+n+mp 2 ) H A(p). 

The intersection on the right will clearly be nonempty so long as the reduction mod p 2 of 
Au(A' + n + mp 2 ), which equals the reduction modp 2 of AU (A' + n), is not all of Z/p 2 Z; 
our task is thus reduced to finding an n for which this holds. Since U and V are nonempty 
it must be the case that S = Tr p (A) and S' = tt p (A') are both proper subsets of Z/p 2 Z. 
But then, if no such n existed, we would have SU(S'+n) = Z/p 2 ZVn G {0, l,...,p 2 - l}. 
Choosing x S, this would imply x — n G S' V n, and therefore |5"| = p 2 , a contradiction. 
Hence, A(p) is topologically transitive. 

We now prove that A(p) is sofic. Let W be the set of all words on the alphabet {0, 1} 
that appear in elements of A(p) . For each w = w\W2 ■ ■ ■ Wd G W (where wi G {0, 1}) 
with d > p 2 , let / = \d/p 2 \ and define 

S p (w) = {w ip2+1 w ip 2 +2 ■ ■ ■ w {i+l)p 2 G W : i = 0, / - 1} U {w fp 2 ■ ■ ■ w d }. 

That is, S p (w) consists precisely of the distinct subblocks of w of length p 2 (as well as 
the final subblock of length d — fp 2 ). Clearly, S p (w) is finite, and moreover there are 
only finitely many possibilities for S p (w) as w varies over W. 

I claim that if S p (w) = S p (w') for two words w, w' G W both of length at least p 2 (but 
not necessarily of equal length), then F(w) = F(w'). It then follows from the above 
(and the fact that there are finitely many words in W of length less than p 2 ) that A(p) 
is sofic. To prove the claim, suppose S p (w) = S p (w') and let v = v\ ■ ■ ■ v r G F(w). We 
wish to show that w'v G W, and thus that v G F(w'); as this is completely symmetric 
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in w and w' , this implies F(w) = F(w'). Now, suppose that w'v G" W. Then there exist 
^o> ^1)^2, •■-,^p2_ 1 > such that 

(^U, ap2 = 1 VaG {0,1,2,. ..,p 2 -l}. 

Since w' and u both belong to W, there is a partition |0, 1, 2, ...,p 2 — 1} = E\\F with 
.E and F both nonempty, such that for all a G E we have a + l a p 2 < d! (d 1 is the length 
of w') and w' c 2 = 1) while for all 6 6 i 7 we have c£' + 1 < b + ^p 2 < df + r and 
v b+e bP 2 -d> = 1- 

Since S p (w') = S p (w), it follows that for each a G J5 there exists i a > such that 
w a+t a p 2 = 1- Thus, if we consider a sequence y € such that = for 

j = 1,2, d + r (such a y exists because wv G W and A(p) is shift invariant), then 
we have y(a + t a p 2 ) = 1 V o 6 £J. For b £ F, since + 1 < 6 + £^p 2 < d' + r we have 
ci+l<(i-(i / + 6 + £ 6 p 2 < d + r and 

y(d - d' + b + £ b p 2 ) = {wv) d _ d , +b+lbp 2 

= vi, + £ bP 2_ d i since d — d' + b + lf,p 2 > d 

= 1 

by the above. However, the fact that S p (w) = S p (w') implies that d = d! modp 2 , so 
d — d! + b + £fep 2 = 6 mod p 2 . Therefore, supp(y) contains the sets {a + t a p 2 : a G E 1 } 
and {<i — <i' + 6 + £bP 2 '■ b € F}, and moreover the reduction modp 2 of the union of these 
sets is E U F = {0, 1, 2, . . . ,p 2 — l}. But this contradicts the fact that y G -4(p). □ 

In fact, the above proof shows that two words w, u appearing in A(p) have the same 
follower set if and only if 7r p (supp(w;)) = 7r p (supp(n)) and \w\ = \u\ modp 2 (where \w\ 
is the length of w), or if w and u both consist entirely of zeros. 

Proposition 2.6. A(pi, . . . ,p r ) is sofic and topologically transitive for any finite col- 
lection of primes pi, ■ ■ ■ ,p r , and therefore is intrinsically ergodic. 

Proof. A(pi, ■ ■ ■ ,p r ) is sofic because it is the intersection of the sofic shifts A(pi) for 
i = 1, . . . ,r, and the intersection of two sofic shifts is again sofic ( |LM| Prop. 3.4.10). 
To prove that it is topologically transitive, let A,B,A',B' be finite subsets of N such 
that U := Ca,b H A(p\, . . . ,p r ) and V := Ca',b' H A(pi, ■ ■ ■ ,p r ) are nonempty. Then 
for each i, the reductions Si := n Pi (A) and S^ := 7r Pi (A') are proper subsets of 7Ljp 2 7L. 
Hence, by the proof of Prop. 12. 5} we may find for each i an m G {0, 1, ... ,p 2 — 1} such 
that Si U (S^ + rii) is again proper. By the Chinese remainder theorem, there exists 
n G N such that n = ni mod p 2 for i = 1, . . . , r. Then, choosing a large m such that 
max A < min(i? / + mp\ ■ ■ ■ p 2 ) and max B < min(j4' + mp\ ■ ■ ■ p 2 ), the set 

U n T -(n+mp r -p T )y = C j4u(A , +n+mp 2... p 2) iBu(B , +n+mp 2... p 2) fl A(pi, ...,p r ) 

is nonempty, as desired. □ 

2.3. Properties of the measures of maximal entropy. In this section we study 
the unique measure of maximal entropy rj r on A(r) := A(pi, . . . ,p r )- We will develop 
explicit formulas describing the behavior of these measures in order to derive properties 
of the full squarefree flow via a limit process. 

Corresponding to the irreducible (= topologically transitive) sofic system A(r) are 
two finite directed labelled graphs G r ,G' r both of whose vertex sets equal the set of 
equivalence classes of finite words appearing in A(r), where two words are considered 
equivalent if and only if their follower sets are equal. Observe that by the proof of Prop. 
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Figure 1. Graph presenting the sofic shift A(p) for p = 2. 



12.51 two words u, v G W(A(r)) are equivalent if and only if \u\ = \v\ mod {p\ ■ ■ -p 2 .) and 
7r Pi (supp(n)) = 7r Pi (supp(u)) for i = 1, 2, . . . ,r, or if u and v both consist entirely of 
zeros. Therefore, the equivalence classes of words in A(r) (and therefore the vertices of 
both of the graphs G r , G' r ) are in bijection with pairs (d, (Si)l =1 ) where d € Z/pf ■ ■ -p^L 
and each Si is a proper subset of Z/p?Z, and the Si are either all empty (corresponding 
to words consisting of only zeros) or all nonempty. 

Letting [it], [v] be vertices of G r , there is a directed edge from [u] to [v] labelled if 
and only if [uO] = [v], and there is an edge labelled 1 if and only if [itl] = [v]. Similarly 
there is a directed edge in G' r from [u] to [v] labelled i £ {0, 1} if and only if [iu] = [v]. 
To illustrate, figure Q] shows a piece of G r in the case where we consider only the single 
prime p = 2. We don't draw the whole graph as it has 2 2 (2 4 — 1) = 60 vertices. 

In this graph, we've made use of the fact that, for instance, [011001] = [011000]. 
Indeed, the words 011001 and 011000 both have length congruent to 2 mod 4 and the 
reduction mod4 of both their supports is the same set, namely {2, 3} mod 4. Observe 
that a sequence of zeros and ones belongs to A(r) if and only if it can be obtained by 
reading in order the labels of the edges in an infinite directed walk on G r . This is the 
same as saying that A(r) is recognizable by the finite automaton G r (as is true of any 
sofic system; see |LM| ) . 

Define the period of A(r) to be the greatest common divisor of the lengths of the 
cycles in the graph G(r) (this definition makes sense for any sofic system). We then 
have the following easy observation, which will be needed later. 

Proposition 2.7. «4.(r) has period pi ■ ■ -p^. 

Proof. We first prove that A.(p) has period p 2 . The graph G p defining this system was 
described in section 12.31 This graph need not be irreducible, but A(p) is presented by 
one of its maximal irreducible components, so we make no distinction between them. 
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One can see from the description of G p that if a path c = i\i2 ■ ■ ■ i n in G p begins and 
ends at the same vertex [w] (i.e. c is a cycle), then we must have \w\ = \w\ + n(modp 2 ), 
and therefore p 2 \n. Conversely, if we choose a vertex [w] and create a directed path in 
G p by beginning at [w] and travelling only along edges labelled 0, then this path will 
return to [w] after exactly p 2 steps. Therefore «4.(p) has period p 2 . 

We can deduce directly from this that A.(r) has period p\ ■ ■ ■ p 2 . By |LM| Thm. 3.4.10, 
the graph representing a finite intersection of sofic shifts is simply the label product of 
the graphs representing each of the shifts, and it's an easy exercise to show that the 
period of the label product of two irreducible graphs is the least common multiple of 
their periods, from which the claim follows. □ 

Let T(r) resp. T'(r) be the adjacency matrix of the finite directed graph G r resp. 
G' r , so T(r) \u\\v\ is the number of directed edges in G r which begin at the vertex [u] and 
end at the vertex [v] (and similarly for T'(r)). By the proof of Thm. 1 of [Wei] we have 
the following facts: letting A r = e htop ^ r '' , A r is an eigenvalue of both T(r) and T'(r). 
We have T(r)T'(r) = T'(r)T(r) and the matrix F(r) := T(r)T'(r) is irreducible, with 
Perron- Frobenius eigenvalue X 2 . There exist strictly positive right and left eigenvectors 
of F(r) both with eigenvalue X 2 , which are unique up to multiplication by positive real 
numbers. Moreover, they're eigenvectors of both T{r) and T'(r), with eigenvalue A r in 
both cases. 

A measure on A(r) is completely determined by its behavior on the cylinder sets 

A(r) w = {x € A(r) : x(i) = Wi for i = 1, . . . , n} 

where w is a length n word appearing in A(r). The measure of maximal entropy on 
A(r) is then given by 

Vr (A(r) w ) = X- n m{r) [w] (2.2) 

where m(r) is the strictly positive right Perron- Frobenius eigenvector for F(r), nor- 
malized so that m(r)r ] + m(r)m = X r (cf. |Weij Eqn. (14)). Our aim is to make this 
expression concrete enough that we can consider the limiting behavior of these measures 
when w is a fixed word in the full squarefree flow A and r — > oo. Such considerations 
allow us to prove in section [3731 that A is intrinsically ergodic. 

We begin by considering a single prime p; the limit as we include more primes is 
of course handled using the Chinese remainder theorem. Let w be an arbitrary word 
appearing in A and define the cylinder set A(p) w as above. The idea is that since r\ v 
is supported on A\(p) by the remarks following Prop. 12.21 we should replace w by 
extensions of it that omit only one residue class (we assume p 2 S> \w\ so this can't 
already be the case). Indeed, let a±, . . . be the classes modp 2 omitted by supp(ui). 
Then Prop. 12.21 implies that up to ^-measure zero we have 

A(p) w = {x € Ai(p) : w C x} 
k 

= {x G A\(p) : w C x and 7r p (supp(x)) = (Z/p 2 Z) \{aj}} 
i=i 

k oo 

= J {x G A{p) : w' C x and m(x) = m}. 

i=l m =l wCw' 
\w'\=p 2 
n p (supp{w'))=(Z/p 2 Z)\{ ai } 

Here, m(x) = min{m : 7r p (supp(2;i • • ■ x mp 2)) = 7r p (supp(x))}, and the notation w C x 
means x(l) • • • x(\w\) = w (and similarly for w C w'). The definition of m(x) as a 
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minimum is needed to produce a disjoint union in the above decomposition. Define 
C^(w,i) = {w' € W(A(p)) : w C w', \w'\ = mp 2 , 7T p (supp(u/)) = Z/p 2 Z\{a,} and 
|7r p (supp(w / 1 • • • w' np2 ))\ < p 2 - 1 for n < m} 
Then the above implies that up to ^-measure zero 

k oo 

•A(P)™ = IIII II A{ P^'- 

i=\m=\ w >e(%i(™,i) 

Therefore, 

k oo 

ri P {A{p) w ) = E E E V p {Ap)w) 
i=l m=l w 'eC^,(w,i) 

and since any word in Cm(w, i) is equivalent to the word Ui = 11 • • ■ 101 • • • 1 which has 
length p 2 and where the occurs at a,-. 12.21 gives 

k oo 

v P (A(p) w ) = E E E v Vm (p)H 

i=l m=l aJ 'gcS 1 (tn,i) 
fc oo 

i=l m=l 

To determine |Cm(«;, «)l i observe that if w' € Cm(w, i), the condition |7r p (supp(u^ • • • w' np2 ))\ 
< p 2 — 1 for n < m in the definition of Cm(w, i) implies that there is a j ^ i such that 
w 'a +ip 2 = 0forl<£<m — 1 while ^. + ( m _ 1 - )p 2 = 1- Thus the m — 2 positions 
Oj + £p 2 , 1 < £ < m — 1 of w' are fixed to be 0; since the position aj + (m — l)p 2 is 
1, there are at most mp 2 — n — (m — 1) — m positions in w' to be freely filled (taking 
into account w C w') once aj is chosen from the k — 1 possibilities, where the second m 
comes from w' having to omit ai(p 2 ) . However, since the reduction of the support of w' 
is (Z/p 2 Z)\{aj}, w' must also have a 1 somewhere in each arithmetic sequence a s (p 2 ) 
for s i,j. As there are m choices of a position for such a 1 for each s ^ i,j, and there 
are k — 2 such s, we see that 

|C& (to,*) | = (fe - 1) m fc-2 2m (p 2 -l)-n-(m.-lMfc-2) 

(the factor of fc — 1 in front comes from choosing a^) and therefore (assuming k > 1) 

fc oo 

^(p)*) = (k - 1) J>(p) W E ^^-ajp^-D—C"*"!)-^). 

i=l m=l 

Now, 

A _ mp2 = ^ 2(1 _ 1/p2) x-^ 2 
_ 2^ m (p 2 ~ 1 ) 

so we get 

fc oo 

Vp (A(p) w ) = (k - l) 2 -«- fc + 3 £ m (p) w ^ m fc " 2 2- m . (2.3) 

j=l m=l 

The sum X^m=i w fc ~ 2 2~ m clearly converges to a value c(k) independent of i; since k 
depends on p and w, so does c(/c). 
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To analyze the terms m(p)\ Vi ], we use the strictly positive left Perron- Frobenius eigen- 
vector m'(p) for F(p), normalized so that m{p) ■ m'(p) = 1. It's related to the right 
eigenvector by the formula ( |KiS| pg. 235) 

/ \ H N ,. ( F (p) np2 )[u][u} , n 

m{p) [u] m (p) M = hm — 3 (2.4) 

for [i/] belonging to the maximal irreducible component of G p . The factor p 2 appears 
because it's the period of A(p) by Prop. 12.71 above, while the 2 in the exponent of A p 
comes from the fact that X 2 , not A p , is an eigenvalue of F(p). 

In our setting, the maximal irreducible component of G p is the induced subgraph 
whose vertices consist of the equivalence classes of words in A(p) whose support omits 
only one residue class modp 2 . If \v] is such an equivalence class, then we see from 
the definition of F(p) and the graphs G p ,G' p that (F(p) np )w,\\ v \ equals the number of 
sequences of pairs ■ ■ ■ , (i np 2,j np 2) such that [j np 2 ■ ■ ■ j\ui\ ■ ■ ■ i np i\ = [v\. Since v 

omits only one residue class modp 2 , we're free to choose such a sequence of pairs so long 
as we get zeros along the length 2np 2 arithmetic sequence on the residue class omitted 
by v, which determines exactly 2n positions in the word j np 2 ■ ■ • j\vi\ ■ ■ ■ i np 2- Hence, the 

total number of such sequences of pairs is 2 2n ( p2 ~ 1 ). Since we also have X p np = 2 2n ^ p2 ~ 1 \ 
(|2.4p gives m(p)\ v vm! (p)\y\ = 1. (|2.3|) therefore becomes 

k 

V P (A(p) w ) = (k — l)2^ k+ \{k)Y,m\p) w ) v (2.5) 

i=l 

To illustrate the idea of our analysis, we calculate (p)n\...ya] (note that this need not 
appear in the above sum). The eigenvector equation m'(p)T(p) = X p m'(p) gives for any 
state [u] 

n»'(p)[„] = A" 1 n ( u 'i u ) m '(p)[v'} ( 2 -6) 
M 

where the sum is over all equivalence classes of words in A(p) and n(v', v) = \{i € {0, 1} : 
[v'i] = This immediately gives 

m'(p) [11...10] = \ p l m'(p) [vi] 

where v\ is the word of length p 2 — 1 consisting only of ones. For < i < p 2 , let V{ be 
the word consisting of p 2 — i ones, and let Uj be the word consisting of p 2 — i zeros; then 
we have [vi] = [V2I] and also [viiiiO] = [vi] = [viu\l\. Therefore 

m'(p)[ii-io] = Xp~ l m'{p)[ vi ] 

= ^p 2 ( m '(p)[v 2 ] + 2m'(p)[„ im] ). (2.7) 

Again by (12 .6p . we have 

m'(p)[ VlUl ] = \~ 1 m'(p)[ VlU2 ] 

= X-(P 2 ^m'(p) [vi] 

so that, by (j2~77]) . 

X p l m'(p) [vi] = X p 2 (m'{p) [v2] +2X p {p2 ~ 1) m'(p) [vi] 
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and solving for m'(p)\ Vl ] yields 

m'ip) [ii.. .10] = K lm \p)[v 1 ] 

Similarly, since 

m'(p)[ V2 ] = A" 1 (m'{p) [v3] + 2m'(p) [v2Ul] ) 
analyzing as above yields 

m'(p) [va] = A^(l - 2\-P 2 )- 1 m'(p) [v3] 

and so 

m / (p)[n...io] = A~ 2 (l - 2\~ p2 )~ l m'{p\ V2] 
= A- 3 (l-2A/)- 2 m'(p) M . 
Continuing this procedure, we find the formula 

m , ( P ) [ n...i 0] = A;^ 2 " 1 )(l - 2A/)-^ 2 - 2 )m / (p) [1] . 

Analyzing the terms m'(p)7 in (|2.5p in the same way gives exactly the same result, 
and so we get (assuming w doesn't omit 1 mod p 2 ) 

k 

Vp (A(p) w ) = (k- l)2~ n ~ k+ \{k) J>'(p)j-J 

i=l 

p 2 -2 



\k - l)kc{k)\f - 1 (l - 2A/) P ~ m'(p)-\. (21 



We now consider the case when r > 1. For z = 1, . . . , r, let Cj be the set of residue 
classes modp 2 omitted by w. Then as above, Prop. 12.31 implies that up to ^-measure 
zero we have 

M r )w = {x E Ai(r) :wCx} 

\x e Ai{r) : w d x and 7r Pi (supp(x)) = (Z/p 2 Z) \{a (0 }} 

(oW,oC 2 ),...,aW)enLi c i 

oo 

= {x £ -4.(f") : w' C x and m(x) = ?n} 

(aW,a( 2 ',...,ffiW) m = 1 wcw' 
ellLi Ci |ui'|=mp?-p? 

7r p .( S uppK))=(Z/p?Z)\{a«} 

where, as in the case of a single prime, m(x) = min{m : 7r Pi (supp(xi • • • x mp 2... p 2)) = 

7r Pi (supp(x)) fori = l,...,r}. For each tuple a = (a^, a^ 2 \ . . . , a^) € YYi=i @i an d 
m > 1 define 



C r m {w,a) = lw' e W(A(r)) : w C w' , \w'\ = mp\ ■ ■ -p 2 , 7r Pi (supp(u/)) = (Z/p 2 Z) \{a®} 
for i = 1, . . . , r and, for n < m,3i such that 

kft(supp(Vi • ■■«4 p 2... p 2))| <Pi ~ *} ■ 
Then by the above, we have up to r/ r -measure zero that 

oo 

r, r {A{r) w )= J2 Vr{A(r) w ,) 
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and since any word in C^w, a) is equivalent to the word v r (a) of length p\- ■ ■ p 2 . which 
has zeros along each arithmetic sequence a^(pf) and ones everywhere else, (j2.2|) gives 



DC 



i lr (A{r) w ) = ™( r )lM«)] E K mPl '" Pr \C r m (w,a)\. 

aellLi m=1 

We now give a lower bound for \C^(w,a)\ that will be useful later. Observe that the 
last condition in the definition of C^(w,a) is satisfied for any n < m as soon as it's 
satisfied for n = m — 1. Choosing a j < r and bV' ^ a® a residue class modp^ to be 

omitted by • • • w^ m _^ p 2 p 2 specifies (m — l)p\ ■ ■ -pj ■ ■ -p 2 . positions in w' to be zero 

in addition to the m[]t=i(^ — 1) + n positions specified by the other conditions in the 
definition of C^w, a). So we find 

\C r m (w,a)\> max 2 mp i-^- m n[=i(pf-l)-(m-l) P ?-p|-p?-n_ 

3=1,-,'' 



„2 „2 



Since A r mPl "' Pr = 2" m ni=i(p?-i), we get 

Vr(A(r) w ) > 2~ n Yl m ( r )lMa)] 



x X~ rnp2l '" p2r max 2 mp i'''^" mn ^ l(p ?~ 1) " (m ~ 1)p i'''^ 2 '''^ 

m=l 

> 2-" max 2^ -Pr Sr (j) V m(r) K(o)] (2.9) 



7=1,. ...r 



where s r (j) = 2 m [^"^- 2 nLiW-i)-rf"^-^ 

We now turn to the terms m(r)[„ r ( a )] . We again use the strictly positive left Perron- 
Frobenius eigenvector m(r) of F(r); just as with (|2.4|) we have 



m(r-)r I/ im'(r-)r 1/ i = lim 



= 1. (2.10) 

To illustrate the analysis of Tn'(r)r v f a \-\, we consider the case when a = (0, 0, ... , 0), so 
u r (a) =: v is the word of length p\ - ■ -p 2 having zeros at the multiples of p 2 for each i 
and ones everywhere else. We proceed as we did with m'{p)^ above. For each s > 
let v s be the first subword of v of length p\ ■ ■ ■ p 2 . — s. We have 

= K 1 ( m '( r )[vi] + 2m '( r )H) 

where u\ is the word consisting of p\ ■ - - p 2 — 1 zeros, and more generally u s is the word 
consisting of p\ ■ ■ • p 2 — s zeros. Now 

m '( r )[u U1 ] = 

= X~ 2 m'(r) [u 



[UU3\ 



= X; p2 - p "- 1 m'(r) [u] 

and so 

m'(r) [v] = - 2K pl - pl )- l m'(r) [vi] . 
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Repeating this for m'(r)[ Ul ], we find as with m'{p)\ Vi \ 

m'(r) M = Ar" P? (l - 2X; pl " P T p2l m'(r) Kl] . 

We've stopped after p\ steps because the last letter of u p 2 is zero (which is not the case 
of any v s for s < pf), so that 




and therefore 

m'(r) M = A^-^l - 2X?*~*)-*m'(T) [v j. 

We continue in this fashion, gaining a factor of A" 1 at each step and a factor of (1 — 
2A r 1 r )~Pi at step s if s not a multiple of pf for any i. The number of such s is 
precisely n[=i(Pi — 1), so we find 

m '(r) [u] = \r Pl - pl+ \l -2A r - p? '" P T nr = l(P? " 1)+1 ^'(0[i]- 
The same formula holds for ^ / (^)[v r (a)l f° r an Y a ) so ([279]) becomes 

r 

»7r(4(r)„) > fci ■ ■ ■ fc r 2-"A^- p? " 1 (l-2A r " Pl "" ? '') n ^^- 1) - 1 m , (r)- 1 ^(^-1)2-^+2^ (fc,)- 

i=l 

(2.11) 

We now use this formula to provide a uniform lower bound for the entries m(r)[ w ] 
of the Perron-Frobenius eigenvector. Such a bound is crucial for proving in the next 
section that A carries a unique measure of maximal entropy. This reflects the impor- 
tance of the rate of convergence (suitably interpreted) of the finite-level systems A(r) 
in establishing properties of the full squarefree flow A. Indeed, any irreducible closed 
subshift whatsoever can be written canonically as a decreasing intersection of decreasing 
irreducible subshifts of finite type. Each of these is necessarily intrinsically ergodic, yet 
the intersection can carry any number of measures of maximal entropy (cf. |Hay| ) since 
the systems need not converge in a uniform way. For another perspective on the way in 
which the rate of convergence of a decreasing sequence of intrinsically ergodic systems 
influences the properties of the limit, see for instance |Gur| . 

Proposition 2.8. There exists a constant L > such that for any r > 1 and any 
word w € W(A(r)) of length at most p\ ■ ■ -p^, we have m{r)y w -\ > L. In particular, if 
w € W(A) (so that w £ W(A(r)) for any r > 1) we have liminf^oo m(r)\ w ] > L. 

Proof. Eq. (|2. 1 1 j) above states that if w is a word in A(r) of length n <p\- ■ ■ p% which 
omits ki residue classes modp? for i = 1, . . . , r then 

r 

Vr(A(r) w ) >ki — k r 2- n \ pi -^- 1 (l-2Ar P? '"^) n ^ (p ?- 1) - 1 m'(r)|- 1] 1 J^-l^-^ 2 ^). 

i=l 

Here s r (ki) = X^m=i( m ^i ' ' ' Pi ' " Pr ~ l) fci_2 2 _mp i"' p ?"' p ^. It can easily be verified that 
2~ ki+2 s r (ki) is increasing with k{ so long as fcj > 2. Therefore, since fcj > 1 for each i 
and r} r (w) = \~ n m{r)^ we have the lower bound 

m(r) M > A^-^-^-^l -2V P?, " rf ) n ^^- 1) -W(r)p ] 1 X} ar (jfc ( = 2). 

i=i 



16 RYAN PECKNER 



By eq. (|2.10p we have m'{r)~^ = m(r)^. Since the word w = 1 of course appears in A, 
the weak* convergence of the rj r and eq. (|2.2p show that the sequence ?n(r)m converges 
to A • i](A\). In particular the sequence is bounded below by a positive number m > 0. 
We have on letting P r = Wi=i(Pi ~ ^) 



OO r ~ 

8r (jj = ^ ^f\p\---Vr-^Pr-p\---v\--V', 



m _ 1 2 pf-p2_2P r -p2... p 2... p 2 _ 1 

so the lower bound becomes 

m(r) M > A? +p? - p? - 1 2" fl (l-2A7 P? "^) P '- 1 m max 2 4^*^-^^-^ 

= A-^l - 2A r T p? - p ') p - 1 m max A ^"^ 2 -(n+p?...rf-2P r -2 P ; ...^)_ 

j'=l,...,r 



Since A r = 2ni=i( 1 ~ 1 /p?) we have for 1 < i < 



A n+p2-p2 2 -(n+p2.-p2-2P r -2p2...p2-p2j = 2n ( n - =i (l_l /p 2 ) _ 1 ) + p r _ p 2... p 2 +2 p r+2p 2... p 2... p ^ 

(2.12) 

Since Ilj=i (1 ~~ 1 /p| ) < 1 an d n < p 2 ■ ■ ■ p 2 . we have for the exponent of the last expression 



J(l - 1/p 2 ) - 1 + P r - p 2 • • -p 2 + 2P r + 2p 2 ---p 2 --- P 2 



> Pi ■ '-Pr !]> - VP?) - 1 + ^ "P? • • -p 2 + 2P r + 2p 2 . . .pj . . . p * 



r 



= 4 J](p 2 - 1) + 2p 2 • • -p) ■ • -p 2 - 2p\ ■ • -p 2 . (2.13) 

I claim that this final expression is positive, which implies that the corresponding power 
of 2 on the right side of eq. (I2.12P is greater than 1. In fact I claim that we already 
have p\ - ■ ■ p 2 < ^Y[ r j=i(.Pj ~ l)j f° r dividing both sides by the term on the left this is 
equivalent to 



> i. 



Since the product is at least 6/tt and ir < 12, this is indeed the case. 

Therefore, eq. (12TT21 shows that A™ +p? ' " p ^2-^\-vl-vl) > i f or { > ^ an d conse- 
quently m(r)\ w ] > A~ 1 (l — 2A r Pl Pr ^i=i^~ 1 )~ 1 m by the previous computations. Now 
it is a simple calculus exercise to show that (1 — 2 x ) x — > 1 as x — > co. Thus if we 
let M = sup x>1 (l — 2 1 ~ x ) x ~ 1 , we have M < oo and m(r)[ w ] > X~ 1 Mm for any r > 1, 
so L = \~ l Mm satisfies the claim. □ 



3. The full squarefree flow 

3.1. Topological properties. We now turn to the full squarefree flow A = P\^LiA(r). 
Prop. 4.4.6 of |LM| states that if X\ 3 X<i 5 X3 ~D ■ ■ ■ are shift spaces whose intersection 
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is X, then ht op (X k ) — > ht op {X). Thus we have by Prop. 

^top(^) = lim/itop(yHpi, ■ ■ ■ iPr)) 



More generally, for any infinite sequence of intgers k = (k p ) € F[p{l' • • • >P 2 ~ 1} ( so 
k p is the number of residue classes omitted modp 2 ) we may define At = ^ p A kp (p) and 
Cj: = n p C kp (p), and then 

^top(C^) = limhtop{C(k u ...,k r )(Pii ■ ■ ■ ,Pr)) 



r 

k 
P 



limlog2 ] [ I 1 

p<p 



log 2 J 



i _ hp 

p2 



P 

This suggests that any invariant measure of maximal entropy on A is concentrated on 
Ax. 

Lemma 3.1. Let 7] be an ergodic measure on A. Then there exists a unique sequence 
k € rip{l' • • • )P 2 — 1} such that rj is supported on Aj:- In particular, any invariant (not 
necessarily ergodic ) measure of maximal entropy on A is supported on A\ . 

Proof. Although one might be tempted to reach the desired conclusion immediately from 
the fact that A is the disjoint union of the A^ over all sequences k, this is invalid since 
there are uncountably many such sequences, so we must argue differently. Let r/ be an 
ergodic measure on A, and let X = supp(?7). For each x E X and each prime p let 
k p (x) = p 2 — |7T p (supp(x))|. k p : X — > N is measurable, and we have for any p 

X = XI X k (p) 

k=l 

where X k (p) = {x € X : k p (x) = k] = X n Ak(p)- Since n is ergodic with n(X) = 1, 
and the X k {p) are invariant (up to absolute measure zero) and mutually disjoint, there 
exists a unique kx{p) £ {1, • • • ,p 2 — 1} such that n(X kx ^) = 1. As this is true for all 
p, and since a countable intersection of sets of full measure also has full measure, we 
have n(X n A^) = n (n p X fcx ( p )) = 1 where k = (kx(p)), which proves the first claim. 
The second claim follows from ergodic decomposition as in the remarks following Prop. 

□ 



We now prove that A isn't sofic by bringing out the role of periodic points. By 
the remarks preceding Thm. 6.1.9 of [Kit] . the set of periodic points in an irreducible 
(=topologically transitive) sofic system is dense. We will prove that A is irreducible 
but that it contains only one periodic point, so it certainly can't be sofic. Actually, 
since irreducibility of a subshift is equivalent to topological transitivity, and A is by 
definition the closure of the orbit of the point (/U 2 (n)) n >i e Q( 2 \ it is immediate that A 
is irreducible. Nonetheless, we present a proof that uses only the definition of A as the 
space of admissible sequences. 
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Proposition 3.2. A is irreducible. 

Proof. First, observe that if w is a finite word on {0, 1}, then w appears in A if and only 
if 7T p (supp(u>)) is a proper subset of 7Ljp 2 7L for every p. Indeed, the analagous claim for 
A(pi, . . . ,p r ) is immediate from the Chinese remainder theorem, and it then follows for 
the full squarefree flow because 0( 2 ) is compact. To be exact, if 7r p (supp(io)) is proper for 
every prime p, then for each r we may find some y r € A(pi, . . . ,p r ) such that w appears 
as the first subword of y r . Since is compact, some subsequence of y r converges 
to an element y of f2( 2 ), which must in fact belong to A because A(pi, ■ ■ ■ ,Pr+i) C 
A(pi, ■ ■ ■ ,p r ) for all r > 1. Since w is clearly the first subword of y, we see that 
w 6 W(A) as desired. 

As before, let A, B, A' , B' be finite subsets of N such that U := Ca,b H A and V := 
Ca',b' H .4 are both nonempty. The key point is that since A and A' are finite, the 
reduction modp 2 of their union will automatically be a proper subset of r L/p 2 'L if p 2 > 
\A\ + \ A'\, so we only need to consider primes smaller than this fixed bound. Let p\ < 
p 2 < . . . < p r be the primes whose squares are less than or equal to \A\ + \A'\, and choose 
a large m such that max A < min(i?' + mp\ ■ ■ ■ p 2 ) and max^4' < min(i? + mp 2 ■ ■ -p 2 )- 
As per the proof of Prop. 12.61 we may find some n € N such that vr Pi (^4 U {A' + n)) = 
TTp i (A U (A' + n + mp 2 ■ ■ ■ p 2 )) is proper for i = 1, . . . , r. On the other hand, for p > p r we 
havep 2 > + = \A\ + \A' +n+mp\ ■ ■ -p 2 \. Therefore, ir p (Au(A' +n+mpj cdotsp 2 )) 
is proper for every prime p, so 

U f1 T ( n+m Pl Pr) = C Au ( A , +n+mp 2... :p 2) iBu ( B / +rl+m p2... p 2) n A 

is nonempty, as desired. 

□ 

To study the periodic points of A, we first examine those of the systems A(pi, . . . ,p r ). 
We begin with the case of a single prime. 

Proposition 3.3. Let p be any prime, and let x be a periodic point in A(p) not equal 
to (0, 0, 0, ... ). Then the period of x is a multiple of p. 

Proof. Let m be the period of x. Then there exists a word w = w± ■ ■ ■ w m such that 
x = (wi,W2, • • • , w m , wi,W2, • • • )• Choose a € {1, 2, . . . ,p 2 } such that a(p 2 ) n supp(x) = 
0, and choose any b G {1,2,..., m} such that Wb = 1 (such a b exists because x ^ 
(0, 0, 0, . . . )). If (m,p 2 ) = 1 (which is the case if and only if m is not a multiple of p), 
then by the Chinese remainder theorem, we may find some n G N such that n = a mod p 2 
and n = b mod m. Therefore, there exist £,£' > such that n = a + £p 2 = b + £'m, 
which implies both 

x(n) = x(a + £p 2 ) = since a + £p 2 € a(p 2 ) 
and, since x has period m, 

x(n) = x(b + £'m) = x{b) = Wf, = 1 

which is a contradiction. 

□ 

This may be extended to a finite set of primes in the obvious way. 

Proposition 3.4. Let p\,...,p r be distinct primes, and let x be a periodic point in 
A{p\, . . . ,p r ) not equal to (0,0, 0, . . . ). Then the period of x is a multiple of p\ ■ ■ ■ p r . 
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Proof. Since A(pi, ■ ■ ■ ,p r ) = r\ r i=l A{pi), x is a periodic point of the system A(pi) for 
i = 1, . . . ,r, and its periods in each of these systems are all equal to its period m in 
A(pi, . . . ,p r )- Therefore, by the above proposition, m is a multiple of pi for i = 1, . . . , r, 
so m is also a multiple of pi ■ ■ -p r . □ 

Prop. 13.21 and the above remarks now imply the following. 

Corollary 3.5. (0, 0, 0, ... ) is the only periodic point of A. In particular, A is not sofic. 

Proof. If x is a periodic point of A, then it is also a periodic point of A(pi, ■ ■ ■ ,p r ) for 
all r > 1. By the above proposition, the period of x must then be a multiple of p± ■ ■ ■ p r 
for all r > 1, which is absurd. □ 

To summarize, A is irreducible and neither sofic nor minimal since it contains the 
fixed point (0,0,0, ... ). Now, it is well-known that the set of transitive points (points 
with dense orbit) in an irreducible system is a dense G$, while the set of intransitive 
points is either empty or dense (cf. }KS] Thm. 4.3.1). Since we know that A contains 
the intransitive point (0, 0, 0, ... ), we get the following. 

Corollary 3.6. Let T be the set of intransitive points of A. 

(1) T is a dense, meagre F a set. 

(2) The closure of the T -orbit of any point in J r \{(0, 0, 0, . . . )} is an infinite proper 
subshift of A. 

One suspects that J- carries some meaningful information relating to arithmetic pro- 
gressions modulo squares of primes, but we won't pursue that possibility here. 

3.2. Approximating A by the finite-level systems. Although A is not sofic (and 
doesn't fall into any class of intrinsically ergodic systems we're aware of), we are able to 
prove in the next section that A is intrinsically ergodic by showing that it is (in more 
than one sense) the limit of the intrinsically ergodic systems A{r). To begin, note that 
some subsequence of the measures r) r converges weak* to a measure r\ on Q,( 2 \ which 
automatically is supported on A and has maximal entropy by the upper semi-continuity 
of the entropy function. To prove that n is the only measure of maximal entropy on A, 
we will exhibit it as a more refined kind of limit of the measures ij r , namely their limit 
under Ornstein's d metric. Good general references on the d-distance are [D], |Gla] Ch. 
15.7, and [Sh] Ch. 1.9. 

We should point out that the facts to follow only apply to invertible processes, i.e. 
two-sided shifts. There are extensions of Ornstein's theory to one-sided shifts (e.g. [HRJ) 
that would apply to the systems we consider. However, we will not need these, since 
there is a natural entropy-preserving bijection between invariant measures on a one- 
sided shift and those on its invertible extension ( |CT| Prop. 2.1). In particular intrinsic 
ergodicity for the two are equivalent, so for our applications it suffices to consider the 
two-sided version of A., i.e. in the following discussion we consider 6i-infinite sequences 
of 0's and l's with admissible support. By abuse of notation we still refer to this subshift 
as ^4, and the associated maximal entropy measure system as A (and similarly for the 
finite level systems A(r)). 

We follow |Gla| §15.7. A pair (X, a) where X = (X,S,n) is a measure-preserving 
dynamical system and a = {Aj}i<j<^ is a finite partition of X into disjoint measurable 
subsets is called a process. Associated to a process is a homomorphism (j) a : X — > 
T,{<f) a )*rr) defined by 4> a x(n) = i 44> S n x € Aj. We call p(X.,a) := (4> a )*r] the 
symbolic representation measure of the process. 
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Given two processes (X.j,ctj),j = 1,2 where both partitions have the same number 
of sets £, define the d-distance between them by 

J((Xi,ai), (X 2 ,a 2 )) = inf {A(a x xI 2 AI 1 x« 2 ):^ JiVl^m)} 

where J(f7i,?7 2 ) is the set of joinings of the two systems. One can show that 

(2((Xi,ai), (X 2 ,a 2 )) = d{p\,p2) 

where p\ = p(X.i,a\), p 2 = (X 2 ,a 2 ), and we note that p\ and /3 2 are both measures on 

n(£). 

d is a complete metric on the set of T-invariant measures Mt(Q), and the topology 
it induces is (much) stronger than the weak* topology ( |Gla| Prop. 15.20). In order to 
prove that rj is unique (which we accomplish in the next section), we must first show 
that it is ergodic in order to follow the outline of Weiss's argument for the uniqueness 
of the measure of maximal entropy on an irreducible sofic shift. This is supplied by the 
following fact. 

Proposition 3.7 ( |Shj Thm. 1.9.15). A d-limit of ergodic measures on a shift space is 
ergodic. 

It is far from true that a weak* limit of ergodic measures is ergodic in general; in 
fact, the ergodic measures on a shift space are weak* dense in the space of invariant 
measures. Since each r] r is ergodic, we naturally aim to prove the following. 

Theorem 3.8. In the above notation, we have d(jj r ,rj) — > as r — > 00. Therefore, 77 is 
ergodic by the above proposition. 

Actually, we will show that the sequence of processes «A(r) (with the time-one par- 
tition) is Cauchy under d, and therefore convergent since d is complete. It follows that 
the entire sequence of measures rj r converges weak*, and thus its (d and weak*) limit 
must equal 77. 

To do this, we will produce for each pair of integers r < s a joining p rs of the systems 
A.(r) and «4.(s) such that 

Prs (Co x Cf II C[ x Cg) -> as r, s ->■ 00 

where Cg = {x € A(r) : x\ = 0},Cf = {x G A(s) : x± = 1} and similarly for C[,Cg. 
Since Cg x Cf II Cf x Cg = a r x A(s) A A(r) x a s , where a r = {Cg, Cf}, a s = {Cg, Cf } 
are the time-one partitions of A{r) resp. A(s), the definition of the d distance as an 
infimum over joinings implies that 

d(A(r), A(s)) < p rs {C r x C{ II C[ x Cg) -> as r, s -> 00 

which yields the desired conclusion. 

Let's first sketch the idea of how to construct the joining p rs . A sequence x £ O^ 2 ^ 
belongs to A(r) if and only if x omits at least one arithmetic progression modulo pf for 
i = 1, . . . , r, i.e. if and only if there exists a £ {1,2,... ,pf} such that x a+ip 2 = for any 
£ > 0. Outside of these arithmetic progressions, however, the entries of x may be chosen 
with complete freedom. Thus, suppose we begin with an r-tuple (ffi, #2, • • • ,5r) S := 
TLjp\L x • • • x TLjp^L and any sequence x' £ SI 2 . To define a sequence in A(r) from this 
information, we start reading the entries of x' until we reach a position congruent to one 
of the gi mod pf, where we insert a zero. We then continue to read x' from where we left 
off until we reach another position congruent to some (likely to be different) gj modp 2 , 
where we again insert a zero, and so on. For example, in the case r = 1, suppose we are 
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given the residue class 3 mod 4 and the sequence x G Q,^ shown below. Then modifying 
x as just described will yield the sequence x' below: 

x = 01110110101101- •■ 
x' = 0101100110010101010 • • • 

where the underlined zeros have been inserted to force x' to omit the progression 3 mod 4. 
We can formalize this as follows. Given any g = (gi, ■ ■ ■ ,g r ) G Z r , define a map 

x fi(2) ^ n (2) by 



by a g (n) = \{m < n : m = gi mod pf for some i < r}\. Then define 



<t>r{g,x) 



if n = gi mod pf for some i < r 



•£"n—a g (n) 



else 



It is straightforward to verify that (fr r is continuous and intertwines the Z-actions on the 
two systems when we consider translation by (— 1, — 1, . . . , — 1) on Z r . Moreover, the 
image of (fr r equals A(r) by construction. Therefore <p r is a topological factor map from 
Z r x fi( 2 ) onto A(r). 

The following proposition reflects the fact that the entries of a member of A(r) may 
be chosen freely once it's omitted at least one arithmetic progression modp 2 for each i. 

Proposition 3.9. Define the measure v r := m r x B(^, |) on Z r x where m r is 
the counting measure and B(^,^) is the maximum entropy Bernoulli measure on 
Then ((j> r )*v r = rj r; the measure of maximal entropy on A(r). 

Proof. Let oj = ((fir)*^', this is an invariant measure on A{r). We will show that the 
measure entropy ^(^(r)) = h top (A(r)), which implies oj = i] r since rj r is the only 
measure of maximal entropy on A(r) by prop. 12.61 

To this end, let n > 1 and consider a word w G W(A(r)) of length n. Let A w = A{r) w 
be the associated cylinder set and let B w = (j)~ (A w ), so that oj(A w ) = v r (B w ). Define 

Iq = {m < n : w m = 0} 
I\ = {m < n : w rn = 1}. 

By definition, we have (g, x) £ B w if and only if (f) r (g, x) m = w m for m = 1, . . . , n. Thus 
if m G Iq, we must have x m _ Qg ( m ) = or m = gi mod p\ for some i < r, while if m G I\ 
we must have x m _ ag ( m ) = 1 and m ^ gi mod pf for any i < r. Defining 5 g (m) = 1 if 
and only if m = gi modp 2 for some i < r and 5 g (m) = else, it follows that we have 

B w = | (g, x) G Z r x (2) : x m _ ag ( m ) = Vm G Iq with 5 g (m) = 0, 

x m-a g (m) = 1 ^ 7T1 ^ ^lj an d gi ^ m mod p 2 for any m G < r} . 

Assume without loss of generality that n, the length of w, has form = kpf ■ ■ -p 2 . We 
will show that for any g G Z s we have 

{m — a g (m) : m G [l, /cp^ • • • pf] with 5 g (m) = 0} = [l, — 1) • • • (pf — 1)] (3.1) 

where [a, 6] denotes the set of integers between a and b. For this, define a map f3 g : N — > N 
by P g (m) = min{£ > m : 5 g (£) = 0}. I claim that f3 g (m) — a g (j3 g (m)) = m + 1 — a 5 (m) 
for any m. Indeed, the number of i strictly between m and f3 g (m) with <5 5 (-^) = 1 is 
by definition a g ((3 g (m)) — a g (m). Since P g (m) by construction equals m + 1 plus the 
number of such £, we see that f3 g (m) = m + 1 + a g (/3 g (m)) — a g (m) as claimed. 
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Define numbers fa for k = 1, . . . ,n — a g (n) by fa = 1 if S g (l) = and fa = fa(l) 
otherwise, then set fa = fa(fa-i) for k > 1. Then it is clear that 5 g (m) = if and only 
if there exists some k such that m = fa. Therefore 

{m — OL g {m) : m G [l, fcp 2 • • -p 2 /] with 6 g (m) = 0} 

= - OC g (fa),fa - Oig(fa), . . . ,fa-a g ( n ) ~ Ol g {fa- ag (n))} ■ 

By the above remarks, we have fa+\ = fa + 1 + a g (fa + i) — a g (fa), which is equivalent 
to (fa+i - a g {fa + i)) - (fa - a g {fa)) = 1 for all k; therefore 

{fa - a g (fa),fa - a g (fa), . . .,P n -a g (n) ~ a g(Pn-a g (n))} =[!,«- a g{ n )\- 

Since n = kp\ ■ ■ -pi and n — a g (n) equals the number of m < n with 5 g (m) = 0, 
we see by inclusion-exclusion and the Chinese remainder theorem that n — a g (n) = 
k(p\ — 1) • • • — 1), which implies the claim. 
By the above decomposition of B w we have 

where G± = {g G Z s : gi ^ m mod pf for any m G < r} and C Qg C is the 
subset {x G £l( 2 ) : a; m _ Q ( m ) = V?n G Jo with S g (m) = and x m _ ag ( m ) = 1 Vm G Ii}. 
Therefore, by eq. (I3.ip and the definition of the Bernoulli measure we have 



v r (B v 

<?6Gi 



1 


Pi 




Pr 




1 




pi 




pi 




1 




pi 




pi 



1=1 



where u(ui, i) is the number of distinct residue classes modulo pf defined by l\. Letting 
a be the time one partition on A(r) and n = kp\ ■ ■ ■ pi as above, it follows that 



w£W(A{r)) 
\w\=n 



w&W{A(r)) i=l 
\w\=n 



k Y[(p 2 i - 1) log 2 - J] log(l - uK i)/p 2 



i=i i=i 
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We will focus on the case when u(w, i) = pf — 1 for i 



l,...,r. Indeed, we see from 



the above and the fact that Yli=i l°g(l ~~ U ( w ^)/Pi) < that 



E 



n(i - {pi - i)/pt 



weW(A{r)) 
\w\=n 
u(w,i)=p^ — l for 8=1, 



8=1 



fcII(p?-l)log2 



i=l 



2 -fcnr=i(p?-i) 



ui£W(4(r)) 
u(w,i)=p 2 — 1 for i=l, 



8=1 



8=1 



E 



i»6W(^(r)) 
|«j|=n 

u(w,i)=p? — 1 for i=l,...,r 



^2 fe nLi(p? x W(.A(r)) : \w\ = kp\---p 2 and u(io,i) = p\ - 1 for i = 1, . . . , r}| 



log 2 J] (1 



i=i 



P? 



Since by definition H u (a 1 r )/(kpf ■ ■ -p r ) — >• /^(^(r)) as -4 oo, and since 
^(^(r)) < /itop(«4(^)) by the variational principle, we see by recognizing h top (A(r)) as 
the last factor in the product above that we'll be done if we can show 

lim 2" fc nr=i0i-i) -\{w £ W(A(r)) : \w\ = kp\---p 2 and u(w,i) = p 2 - 1 for i = 1, . . . , r} 



k— >oo 



\ 2 2 



(3.2) 



First consider the case r = 1, so that we are considering just one prime p. Let 

W^fc) = {w € W(^4(p)) : H = /cp 2 and u(w) = p 2 - l} , 

that is to say, W-^fc) consists of those words of length kp 2 whose support omits exactly 
one residue class modp 2 . We have 

w\k)= n wKk) 

where Wg(k) is the set of length kp 2 words that omit only g mod p 2 from their support. 
Now, for each g £ Z/p 2 Z let 

W g {k) = {w € W{A{p)) : \w\ = kp 2 and g £ 7r p (supp(u;))}. 

That is, W g (k) consists of all length kp 2 words that omit g modp 2 (but may omit other 
residue classes as well). Then we have W g (k) = Wg(k) TJ V g (k), where Wg(k) is as above 
and V g (k) is the set of length kp 2 words that omit g and at least one other progression 
modp 2 . 

Observe that \W g (k)\ = 2 k P 2 ~ k = 2 fc (P 2 " 1 ), since W g (k) is defined solely by the condi- 
tions that w have length kp 2 and have zeros in the positions g+£p 2 for i = 0, 1, . . . , k — 1, 
where g is the smallest integer representative of g 6 Z/p 2 Z. On the other hand, we have 

V g (k)= (J {weW g (k):h?Tr p (mpp(w))}. 
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For any h E Z/p 2 Z,/i 7^ 5, the condition that w belong to the set {w G W g (k) : h 
7T p (supp(w;))} determines 2k entries of w; namely, that w must have zeros in the positions 
g + tp 2 and h + lp 2 for ^ = 0,1,... ,/c — 1. The other entries of w may be chosen with 
complete freedom; therefore \{w G W g (k) : h ^ 7r p (supp(u;))}| = 2 kp ~ 2k = 2 k ( p ~ 2 \ As 
this is independent of the choice of h we get the estimate 

\V g {k)\ < (p 2 - l)2 k ^ 2 ~ 2 \ 

Hence, the decomposition W g (k) = W g (k) V g (k) for any g G Z/p 2 Z yields the lower 
bound 

\W*(k)\ = \W g (k)\ - \V g (k)\ > 2 fc ( p2 " 1 ) - (p 2 - l)2 k( - p2 ~V 
and it follows that for any g we have 

lim 2- k ^ 2 -^\Wl{k)\ > 1. 
k— too 

Since this holds for any g, we find 

lim 2~ k{p2 - 1 '>\W 1 (k)\ = V lim 2~ k{ P 2 -^\W^k)\ > p 2 

k— >oo *■ — ' k— >oo 



which proves the inequality fj3.2j) in this case. 

In fact, we can deduce the inequality (13.2|) for any r > 1 from this using induction on 
r and the Chinese remainder theorem. Indeed, let 

W}(k) = {w G W(A(r)) : |to| = kp\ ■ ■ ■ p 2 and u(w, i) = p 2 — 1 for i = 1, . . . , r} . 

Our aim is to show that lim^oo 2~ k ^=^~ l ^\W}{k)\ > p\ ■ ■ -p 2 . 

A word w of length kp\ ■ ■ ■ p 2 belongs to W}{k) if and only if w belongs to W^_i{kp 2 ) 
and the support of w omits exactly one residue class modp 2 . The latter condition forces 
w to have zeros in the kp\ • • • p 2 _\ positions g+lp 2 for I = 0, 1, . . . , kp\ ■ ■ ■ p 2 _i — 1, where 
ir r (g) G Z/p 2 Z is the unique class omitted by w m.od p 2 . By the Chinese remainder 
theorem, this implies that 

IW^AOI = pp-^^-^lW^kp 2 )] + O(l). 

Consequently 

2-* nr=i (pf-i) \w}{k)\ = 2~ kp2 nr=i &>?-i)+* nr=i #-1) ^ w i^ | 

= p p 2 - & ^ ni=i I W^iCfc^) I + o(l) as A; -> 00. 

The claim now follows by induction on r, the case r = 1 having been shown above. 
Indeed, assuming the lower bound (|3.2p holds for r — 1, we have 



lim 2- fc n[ =1 (p?-i) . \wXk)\ = lim p^r^nUG*- 1 ) IW^fcp 

\ 2 2 2 



Now it follows from the computations preceding the inequality (|3.2p that /i^(^4(r)) = 
htop(A(r)), which shows that 10 = rj r , as desired. □ 

Now suppose s > r, so we have maps (j) s : Z s x fi( 2 ) — > ^4(s) and (j) r : Z r x — s> A{r). 
Since s > r, Z r is naturally a factor of Z s under the map which ignores the last s — r 
coordinates. Therefore, we may pull back 4> r to a map <j> r : Z s x n( 2 ) -> .A(r). This 
remains a topological factor map, and we clearly have (<t>r)*v s = r/ r . 

Denote by X s the measure-preserving system (Z s x fi( 2 ),i/ s ) with the action being 
the product of translation by (—1, —1, . . . , —1) on Z s with the left shift on f^ 2 ). Then 
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the above constructions imply that X s is an extension of both Air) and A(s) under the 
maps 4> r resp. (f) s . Thus, we have the joining 

p rs := {(j) r x (fr s )*V s 

of A{r) and A{s). 

We proceed to estimate p rs {C r Q x Cf II C{ x Cg) = p rs (C£ x Cf) + p rs {C{ x C5 5 ). We 
have by definition 

Prs (c r x cf) = u s {Q> r x ^r^cs x Cf)). 

Suppose (g,x) G (0 r x <^> s ) _1 (Cq x Cf). To have cp r (g,x) G Cq we must have xi = 
or <7j = 1 mod pf for some i < r, while to have cp s (g,x) G Cf we must have x\ = 1 
and ^ 1 mod pf for z = 1, . . . , s. These conditions are clearly incompatible; therefore 
ft. x <j) s )- x {C r Q x Cf) = and p rs (C r Q x Cf) = 0. 
We now turn to 

Prs (C{ x C S ) = v s {$ r x S )- 1 (C[ x C S )). 

Suppose (g,x) G (<^> r x S ) _1 (C[ x Co). To have cj) r (g,x) G C[ we must have xi = 1 and 
(?.; ^ 1 mod pf for i = 1, . . . , r (but it is important to note that we can have g% = \ mod 
for i> r\ while to have (f) s (g,x) G Cq we must have x± = or gi = 1 modp? for some 
i < s. The only way these conditions can be met simultaneously is if x\ = 1 and 
g-i = 1 mod p? for some i > r. Therefore 

(^x0 s )- 1 (C[xC o s ) = G M xd 

where Ci = {x G 0( 2 ) : xi = 1} and 

G( r s ] = {<? G Z s : gi = 1 mod pf for some r < i < s} . 

We have 

s 

G {r,s] = U {ff^s : ft El mod P?} 



and therefore 



i=r+l 



|G(r, S ]l <Pl--'Pf Yl P r+1 ■■■Pi---P 2 s- 

i=r+l 



Since -B(^, ^)(Ci) = 1/2, we finally obtain 



Prs {C{ x C S ) = !/ s ((0 r x (j> s )-\Cl x C S )) 
1 



2p\ ■■■p 



2 I \?A I 



1 s 

z Pr+l Ps 



i=r+l 
1 

+ ^5— + ••• + 



2 VPr+l Pr+2 Pi 

and this converges to as r, s — > oo because the sum of the reciprocals of the squares 
converges. This proves thm. 13.81 
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3.3. Intrinsic ergodicity. We now come to the proof of our main theorem. 

Theorem 3.10. ij is the only measure of maximal entropy on A. 

Our proof of this theorem relies on the following lemma of Weiss. Recall that the 
intrinsic ergodicity of A is equivalent to that of its two-sided extension, so by abuse of 
notation we may consider the inverse of the shift acting on A. With this understood, 
for a partition a and n > 1 we use the notation a™ n = V™ = _ n T _:, a. 

Lemma 3.11 ( [Wei] Lemma 2). Let Y be a closed invariant subset of fi" with topo- 
logical entropy log A, and let a be the time one partition ofY. Suppose rj is an ergodic 
measure on Y such that h^iY) = ht op (Y) = log A, and suppose there is a constant c > 
such that for any atom A € a™ n we have rj(A) > c~ A _2n_1 . Then Y is intrinsically 
ergodic. 

Thus we need to show that r/ satisfies the conditions of this lemma; since we proved 
in the last section that n is ergodic, we need only prove the existence of a constant c > 
satisfying the stated lower bound. This bound is often referred to as the "lower Gibbs 
property" for the measure 7], cf. |CTj §5.3 (I thank Dan Thompson for this terminology), 
and we see that it is powerful enough to imply the uniqueness of an ergodic measure of 
maximal entropy. 

Recall the definition of the finite-level measures rj r (eq. (|2.2p ): given a word w € 
W{A(r)) of length n we set 

Vr(A(r) w ) = \- n m(r) [w] 

where m(r) is the strictly positive right Perron-Frobenius eigenvector for the matrix F(r) 
described in sec. 12.31 normalized so that m(r)[ ] + m(r)m = A r . The strict positivity 

of m(r) then implies that the quantity Ar"' • i] r (A(r) w ) is uniformly bounded below as 
w varies over the words in A(r). Thus rj r satisfies the lower Gibbs bound, verifying the 
fact that r] r is the only measure of maximal entropy on A(r) (this is precisely why an 
irreducible sofic system always carries a unique measure of maximal entropy) . 

In our specific setting, we have uniform control over the lower Gibbs bound for A(r) 
as r — > oo; namely, by prop. 12.81 we know there exists a constant L > such that for 
any r > 1 and any word w E W(A(r)) of length 2n + 1 < p\ ■ ■ ■ p%, we have 

\l n+l -r, r {A{r) w )>L. (3.3) 

Now, we showed in the last section that n is the d- limit of the measures rj r , so all 
the more so is their limit in the weak* topology. Consequently, if w is a word in A 
of length 2n + 1, we clearly have Aw = n^ 1 ^4(r) u , and, by general measure theory, 
f]{Aw) = limr-Kx) Vr(Aw) = lim r -^oo^r (A(r) w ). Therefore eq. (13.31) yields 

X 2n+1 ■ n{A w ) = lim \ 2 r n+l ■ Vr (A(r) w ) > L 

r— >oo 

since w is fixed and we have 2n + 1 < p\ ■ ■ ■ for all sufficiently large r. 

Therefore n has the lower Gibbs property. Since it is ergodic by 13.81 Weiss's lemma 
then implies that n is the only measure of maximal entropy on A. This completes the 
proof of EH 

3.4. Structure of the maximal entropy system. For each r > 1, let A.(r) be the 
measure-preserving dynamical system (A(r),T,rj r ) and let A. = (A,T,r)) where r\ is 
the unique measure of maximal entropy on A. By the main result of (T], the class of 
processes of form (Z x B, S x T,m x fS) where (Z, S, m) is ergodic of entropy zero and 
(B,T,/3) is a Bernoulli shift is closed under taking factors. We refer to such processes 
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as entropy zero x Bernoulli for obvious reasons. Such a system has Z as its Pinsker 
factor. In our setting, 7L T (equipped with translation by ( — 1, —1, —1, . . . )) is naturally 
a factor of A(r) under the map that sends a sequence x € A\{r) to the unique residue 
class modpf • • -p 2 r omitted by its support (recall that rj r {A\{r)) = 1). It follows that 7L T 
is a factor of the Pinsker factor of A{r). 

By prop. 13.91 A{r) is a factor of Z r x B, where B is the Bernoulli system of entropy 
log 2. Moreover, by the fundamental work of Ornstein, a Bernoulli shift is classified up 
to isomorphism by its entropy [Dj. Combining this with Thouvenot's theorem and the 
observations above yields the following. 

Lemma 3.12. The map <j) r yields an isomorphism A{r) = 7L T x B r , where B r is the 
Bernoulli system of entropy log 2 ni=i U ~~ ^/pf) ■ ^ n particular, 7L r is the Pinsker factor 
of A{r). 

By thm. 13.81 the processes A(r) (with their time zero partitions) converge under d 
to the process A. By the main result of [ShTj . the class of entropy zero x Bernoulli 
processes is closed under taking d-limits. In our case, the entropy zero components 
of the systems A{r) have discrete spectrum, which must increase to the spectrum of 
the entropy zero component of the d-limit (cf. |RS| ). This finally yields the following 
description of the maximal entropy measure system for the squarefree flow. 

Theorem 3.13. Let K, be the Kronecker system (j\ p Z/p 2 Z, T(_i_i_i... ), VHaarJ ■ Then 
we have an isomorphism A = K, x B sq , where B sq is the Bernoulli system of entropy 
(6/-7T 2 ) log 2. In particular, K is the Pinsker factor of A. 

3.5. Conclusion. To summarize, we have shown that there is a unique measure of 
maximal entropy on the squarefree flow and described the structure of the associated 
maximal entropy measure-preserving system. One may naturally wonder whether these 
results could be obtained in a more hands-off way by proving that the squarefree flow 
belongs to a class of systems that are known to be intrinsically ergodic. It's easy to 
see that this won't be one of the common classes (sofic shifts, /3-shifts, S"-gap shifts, 
etc.), one reason being that A has only one periodic point. I am unaware of any classes 
very different from these for which intrinsic ergodicity is known. Alternately, given its 
arithmetic significance, the squarefree flow may be the first example of a new class of 
intrinsically ergodic systems that deserve to have their properties formally extracted 
and defined. 
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